SEQUENCE LISTING 



( 1 ) GENE] 



INFORMATION 



(i) APPLICANT: LaVALLIE, EDWARD 

RACIE , LISA 

(ii) TITL^VOF INVENTION: HUMAN SDF-5 PROTEIN AND COMPOSITIONS 
(iii) NUMBERXoF SEQUENCES: 3 



(iv) 



CORRESP 
(A) AD 
(B) 
(O 
(D) 
(E) 
(F) 



)NDENCE ADDRESS : 

ESSEE: GENETICS INSTITUTE, INC 
STREET: 8 7 CAMBRIDGE PARK DRIVE 
CITY\ CAMBRIDGE 
STATEi: MA 
COUNTRY : USA 
ZIP: 0fel40 



<v) COMPUTER READABLE FORM: 

(A) MEDIUM TOPE: Floppy disk 

(B) COMPUTER^ IBM PC compatible 

(C) OPERATING\SYSTEM: PC-DOS/MS -DOS 

(D) SOFTWARE : Vatentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA 

(A) APPLICATION NUMBER 

(B) FILING DATE: 

(C) CLASSIFICATIOl 




(viii) ATTORNEY/AGENT INFORMATION 

(A) NAME: LAZAR, STEVEN R 

(B) REGISTRATION NUMBER: 32,618 

ix) TELECOMMUNICATION INFORMATION 

(A) TELEPHONE: (617) 49St-8260 

(B) TELEFAX: (617) 876-5851 



(2) INFORMATION FOR SEQ ID NO : 1 : 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 02 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
GAATTCGGCC TTCATGGCCT AGCTCATTCT GCTCCCCCGG GTCGC^GCCC CCCGGAGCTG 
CGCGCGGGCT TGCAGCGCCT CGCCCGCGCT CCTCCCGGTG TCCCGCTOCT CCGCGCCCCA 



GCCGCCGGCT GCCAGCTTTT CGGGGCCCCG AGTCGCACCC AGCGAAGA( 



GCGGGCCCGG 



GACAAGCTCG AACTCCGGCC GCCTCGCCCT TCCCCGGCTC CGCTCCCTCT \3CCCCCTCGG 

GGTCGCGCGC CCACGATGCT GCAGGGCCCT GGCTCGCTGC TGCTGCTCTT COJCGCCTCG 

/ 

CACTGCTGCC TGGGCTCGGC GCGCGGGCTC TTCCTCTTTG GCCAGCCCGA CTT&TCCTAC 

AAGCGCAGCA ATTGCAAGCC CATCCCGGCC AACCTGCAGC TGTGCCACGG CATCGAATAC 



60 
120 
180 
240 
300 
360 
420 
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CAGAACATGQ GGCTGCCCAA CCTGCTGGGC CACGAGACCA TGAAGGAGGT GCTGGAGCAG 
GCCGGCGCTT GGATCCCGCT GGTCATGAAG CAGTGCCACC CGGACACCAA GAAGTTCCTG 
TGCTCGCTCT TCGCCCCCGT CTGCCTCGAT GACCTAGACG AGACCATCCA GCCATGCCAC 
TCGCTCTGCG TGOAGGTGAA GGACCGCTGC GCCCCGGTCA TGTCCGCCTT CGGCTTCCCC 
TGGCCCGACA TGCTTGAGTG CGACCGTTTC CCCCAGGACA ACGAC CTTTG CATCCCCCTC 
GCTAGCAGCG ACCACqTCCT GCCAGCCACC GAGGAAGCTC CAAAGGTATG TGAAGCCTGC 
AAAAATAAAA ATGATGATGA CAACGACATA ATGGAAACGC TTTGTAAAAA TGATTTTGCA 
CTGAAAATAA AAGTGAAQGA GATAACCTAC ATCAACCGAG AT AC C AAAAT CATCCTGGAG 



ACCAAGAGCA AGACCATT' 



CAAGCTGAAC GGTGTGTCCG AAAGGGACCT GAAGAAAT CG 



GTGCTGTGGC TCAAAGACAGV CTTGCAGTGC ACCTGTGAGG AGATGAACGA CATCAACGCG 

CCCTATCTGG TCATGGGACA GAAACAGGGT GGGGAGCTGG TGATCACCTC GGTGAAGCGG 

TGGCAGAAGG GGCAGAGAGA GTTCAAGCGC ATCTCCCGCA GCATCCGCAA GCTGCAGTGC 

TAGTCCCGGC ATCCTGATGG CTOCGACAGG CCTGCTCCAG AGCACGGCTG ACCATTTCTG 

CTCCGGGATC TCAGCTCCCG TTCCfcCAAGC ACACTCCTAG CTGCTCCAGT CTCAGCCTGG 

GCAGCTTCCC CCTGCCTTTT GCACGTOTGC ATCCCCAGCA TTTCCTGAGT TATAAGGCCA 



CAGGAGTGGA TAGCTGTTTT CACCTi 



lGG AAAAGCCCAC CCGAATCTTG TAGAAATATT 



CAAACTAATA AAATCATGAA TATTTTTA: 



AAGTTTAAAA ATAGCTCACT TTAAAGCTAG 



TTTTGAATAG GTGCAACTGT GACTTGGGTC \TGGTTGGTTG TTGTTTGTTG TTTTGAGTCA 

GCTGATTTTC ACTTCCCACT GAGGTTGTCA TAACATGCAA ATTGCTTCAA TTTTCTCTGT 

GGCCCAAACT TGTGGGTCAC AAACC CTGTT GAGATAAAGC TGGCTGTTAT CTCAACATCT 

TCATCAGCTC CAGACTGAGA CTCAGTGTCT AAGTOTTACA ACAATTCATC ATTTTATACC 

TTCAATGGGA ACTTAAACTG TTACATGTAT CACATTOCAG CTACAATACT TCCATTTATT 




GCACAT TAACCATTTC TAT AG C ATG A TTTCTTGi 



TAAAAGGCAA AAGATATAAA 



TTTTATAATT QACTTGAGTA CTTTAAGCCT TGTTTAAAAC ^TTTCTTACT TAACTTTTGC 
AAATTAAACC CATTGTAGCT TACCTGTAAT ATACATAGTA GTTTACCTTT AAAAGTTGTA 



AAAATATTGC TTTAACCAAC ACTGTAAATA TTTCAGATAA ACATTATATT CTTGTATATA 
AACTTTACAT CCTGTTTTAC CTAAAAAAAA AAAAAAAAAG CGGCC< 

* 

(2) INFORMATION FOR SEQ ID NO : 2 : 



480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2027 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 95 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 
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Met Leu Gin Gly Pro Gly Ser Leu Leu Leu Leu Phe Leu Ala Ser His 
15 10 15 



:ys Cys Leu Gly Ser Ala Arg Gly Leu Phe Leu Phe Gly Gin Pro Asp 

20 25 30 

Phe\ser Tyr Lys Arg Ser Asn Cys Lys Pro lie Pro Ala Asn Leu Gin 
35 40 45 

Leu Cyh His Gly lie Glu Tyr Gin Asn Met Arg Leu Pro Asn Leu Leu 
50 \ 55 60 

Gly His telu Thr Met Lys Glu Val Leu Glu Gin Ala Gly Ala Trp lie 
65 \ 70 75 80 

Pro Leu Va\ Met Lys Gin Cys His Pro Asp Thr Lys Lys Phe Leu Cys 

85 90 95 

Ser Leu Phe A\a Pro Val Cys Leu Asp Asp Leu Asp Glu Thr lie Gin 

10\) 105 110 

Pro Cys His Ser \Leu Cys Val Gin Val Lys Asp Arg Cys Ala Pro Val 
115 \ 120 125 

Met Ser Ala Phe GlV Phe Pro Trp Pro Asp Met Leu Glu Cys Asp Arg 
130 \ 135 140 

Phe Pro Gin Asp Asn Afep Leu Cys lie Pro Leu Ala Ser Ser Asp His 
145 15*0 155 160 

Leu Leu Pro Ala Thr Glu\Glu Ala Pro Lys Val Cys Glu Ala Cys Lys 

165 \ 170 175 

Asn Lys Asn Asp Asp Asp Asii Asp lie Met Glu Thr Leu Cys Lys Asn 

180 \ 185 190 



Asp Phe Ala Leu Lys lie Lys V&l Lys Glu lie Thr Tyr lie Asn Arg 
195 20\) 205 



Asp Thr Lys lie lie Leu Glu Thr 



210 



215 



jys Ser Lys Thr lie Tyr Lys Leu 
v 220 



Gly Val Ser Glu Arg Asp Leu Lys\ Lys Ser Val Leu Trp Leu Lys 

230 \ 235 240 

Asp Ser Leu Gin Cys Thr Cys Glu Glu M&t Asn Asp lie Asn Ala Pro 

245 250, 255 



Tyr Leu Val Met Gly Gin Lys Gin Gly Gly &lu Leu Val lie Thr Ser 

260 265 \ 270 



Val Lys Arg Trp Gin Lys Gly Gin Arg Glu 
275 280 

Ser lie Arg Lys Leu Gin Cys 
290 295 



Phe\ Lys Arg lie Ser Arg 
285 



INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 75 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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/ 



(xd) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 



\ 



Ser\ Ala Arg Gly Leu Phe Leu Phe Gly Gin Pro Asp Phe Ser Tyr Lys 
1 \ 5 10 15 



Arg Ser Asn Cys Lys Pro lie Pro Ala Asn Leu Gin Leu Cys His Gly 

20 25 30 



\ 



lie Glu ^yr Gin Asn Met Arg Leu Pro Asn Leu Leu Gly His Glu Thr 
L 5 40 45 



Met Lys Glu Val Leu Glu Gin Ala Gly Ala Trp lie Pro Leu Val Met 
50 \ 55 60 

Lys Gin Cyk His Pro Asp Thr Lys Lys Phe Leu Cys Ser Leu Phe Ala 
65 \ 70 75 80 

Pro Val Cys \.eu Asp Asp Leu Asp Glu Thr lie Gin Pro Cys His Ser 

85 90 95 

Leu Cys Val Gl\ Val Lys Asp Arg Cys Ala Pro Val Met Ser Ala Phe 

100\ 105 110 

Gly Phe Pro Trp kro Asp Met Leu Glu Cys Asp Arg Phe Pro Gin Asp 
115 \ 120 125 

Asn Asp Leu Cys Ile\Pro Leu Ala Ser Ser Asp His Leu Leu Pro Ala 
130 \ 135 140 

Thr Glu Glu Ala Pro L}te Val Cys Glu Ala Cys Lys Asn Lys Asn Asp 
145 ISO, 155 160 



Asp Asp Asn Asp lie Met Slu Thr Leu Cys Lys Asn Asp Phe Ala Leu 

165 \ 170 175 

Lys lie Lys Val Lys Glu Ile\ Thr Tyr lie Asn Arg Asp Thr Lys lie 



180 



185 



190 



lie Leu Glu Thr Lys Ser Lys Thr lie Tyr Lys Leu Asn Gly Val Ser 
195 20CX 205 

Glu Arg Asp Leu Lys Lys Ser Val Veu Trp Leu Lys Asp Ser Leu Gin 

210 215 \ 220 



hr Cys Glu Glu Met Asn Asp IleVAsn Ala Pro Tyr Leu Val Met 

230 \ 235 240 

y Gin Lys Gin Gly Gly Glu Leu Val iPfe Thr Ser Val Lys Arg Trp 

245 25(A 255 



Gin Lys Gly Gin Arg Glu Phe Lys Arg lie £?er Arg Ser lie Arg Lys 

260 265 \ 270 

Leu Gin Cys 
275 
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V 



